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Abstract. The syntactic complexity of a regular language is the cardi- 
nality of its syntactic semigroup. The syntactic complexity of a subclass 
of regular languages is the maximal syntactic complexity of languages 
in that subclass, taken as a function of the state complexity of these 
languages. We study the syntactic complexity of star-free regular lan- 
guages, that is, languages that can be constructed from finite languages 
using union, complement and concatenation. We find tight upper bounds 
on the syntactic complexity of languages accepted by monotonic and par- 
tially monotonic automata. We introduce "nearly monotonic" automata, 
which accept star-free languages, and find a tight upper bound on the 
syntactic complexity of languages accepted by such automata. We conjec- 
ture that this bound is also an upper bound on the syntactic complexity 
of star-free languages. 

Keywords: finite automaton, monotonic, nearly monotonic, partially 
monotonic, star-free language, syntactic complexity, syntactic semigroup 



1 Introduction 

The class of regular languages is the smallest class containing the finite lan- 
guages and closed under union, concatenation and star. Since regular languages 
are also closed under complementation, one can redefine them as the smallest 
class containing finite languages and closed under boolean operations, concate- 
nation and star. Star-free languages constitute the smallest class containing the 
finite languages and closed under boolean operations and concatenation. In 1965, 
Schiitzenberger proved [18] that a language is star-free if and only if its syn- 
tactic monoid is group-free, that is, has only trivial subgroups. An equivalent 
condition is that the minimal deterministic automaton of a star-free language is 
permutation-free, that is, has only trivial permutations. Another point of view is 
that these automata are counter-free, since they cannot count modulo any integer 
greater than 1. They can, however, count to a threshold, that is 0, 1, . . . , (n— 1 or 
more). Such automata are called aperiodic, and this is the term we use. Star- free 
languages were studied in detail in 1971 by McNaughton and Papert [14]. 
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The state complexity of a regular language is the number of states in the 
minimal deterministic finite automaton (DFA) recognizing that language. The 
state complexity of an operation on languages in a subclass of regular languages 
is the maximal state complexity of the language resulting from the operation 
taken as a function of the state complexities of the arguments in that subclass. 
State complexity of regular operations has been studied quite extensively; for a 
survey of this topic and a list of references see [20] . 

An equivalent notion to state complexity is that of quotient complexity [2], 
which is the number of left quotients of the language; we prefer to use the term 
quotient complexity. 

The notion of quotient complexity can be derived from the Nerode equiv- 
alence [16], while the Myhill equivalence [15] leads to the syntactic semigroup 
of a language and to its syntactic complexity, which is the cardinality of the 
syntactic semigroup. It was pointed out in [5] that syntactic complexity can be 
very different for regular languages with the same quotient complexity. 

In contrast to state complexity, syntactic complexity has not received much 
attention. In 1970 Maslov [13] dealt with generators of the semigroup of all 
transformations of a finite set in the setting of finite automata. In 2003-2004, 
Holzer and Kdnig [9], and independently, Krawetz, Lawrence and Shallit [12] 
studied the syntactic complexity of automata with unary and binary alphabets. 
In 2010 Brzozowski and Ye [5] examined ideal and closed languages, and in 2011 
Brzozowski, Li and Ye [3] studied prefix-, suffix-, and bifix-free languages and 
their complements. 

Here we deal with star-free languages. It has been shown in 2011 by Brzo- 
zowski and Liu [4] that boolean operations, concatenation, star, and reversal in 
the class of star-free languages meet all the quotient complexity bounds of regular 
languages, with a few exceptions. Also, Kutrib, Holzer, and Meckel [10] proved 
in 2011 that in most cases exactly the same tight state complexity bounds are 
reached by operations on aperiodic nondeterministic finite automata (NFA's) as 
on general NFA's. In sharp contrast to this, we show that the syntactic complex- 
ity of star- free languages is much smaller than that of regular languages, which 
is n" for languages with quotient complexity n. We derive tight upper bounds 
for three subclasses of star-free languages, the monotonic, partially monotonic, 
and nearly monotonic languages. We conjecture that the bound for star-free 
languages is the same as for nearly monotonic languages. 

The remainder of the paper is structured as follows. Our terminology and 
some basic facts are stated in Section 2. Aperiodic transformations are exam- 
ined in Section 3. In Section 4, we study monotonic, partially monotonic, and 
nearly monotonic automata. In Section 5 we derive a tight upper bound on 
the syntactic complexity of star- free languages having quotient complexity 3. 
Section 6 concludes the paper. 
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2 Preliminaries 



We assume that the reader is familiar with basic theory of formal languages as 
described in [17], for example. Let E be a non-empty finite alphabet and E* , 
the free monoid generated by E. A word is any element of E* , and the empty 
word is e. The length of a word w G E* is |to|. 

A language over E is any subset of E* . For any languages K and L over 
E, we consider the following boolean operations: complement (L = E* \ L), 
union (KUL), intersection (Kf)L), difference (K\L), and symmetric difference 
(K © L). The product, or (con) catenation, of K and L is KL = {w G E* \ w = 
uv, u G K,v G L); and the star of L is L* = IJijso 

Regular languages are defined by induction: For the basis, the languages 0, 
{e}, and {a} for any a G E are regular. Then if K and L are regular, so are 
KUL, KL and L*. Finally, no language is regular unless it is constructed from 
the basic languages using a finite number of the operations of union, product and 
star. Since regular languages are also closed under complements, we can rede- 
fine them as the smallest class containing the basic languages and closed under 
boolean operations, product, and star. Star-free languages are the smallest class 
of languages constructed from the basic languages using only boolean operations 
and product. 

A deterministic finite automaton (DFA) is a quintuple A — (Q,E,8,q\,F), 
where Q is a finite, non-empty set of states, E is a finite non-empty alphabet, 
5 : Q x E — > Q is the transition function, q\ G Q is the initial state, and F C Q 
is the set of final states. We extend S to Q x E* in the usual way. The DFA 
A accepts a word w G E* if S(qi,w) G F. The set of all words accepted by A 
is L{A). Regular languages are exactly the languages accepted by DFA's. By 
the language of a state q of A we mean the language L q accepted by the DFA 
(Q, E, S, q, F). A state is empty if its language is empty. 

An incomplete deterministic finite automaton (IDF A) is a quintuple X = 
(Q, E, S, qi,F), where Q, E, qi and F are as in a DFA, and 5 is a partial function 
such that either S(q, a) = p for some p G Q or S(q, a) is undefined for any p,q G Q, 
a G E. Every DFA is also an IDFA. 

The left quotient, or simply quotient, of a language L by a word w is the 
language L w = {x G E* \ wx G L}. For any language L over E, the Nerode 
equivalence ~£ of L is defined as follows [16]: 

x ~l y if and only if xv G L <^> yu G L, for all u G E* . 

Clearly, L x — L y if and only if x V- Thus each equivalence class of the Nerode 
equivalence corresponds to a distinct quotient of L. 

Let L be a regular language. The quotient DFA of L is A — (Q, E, S, qi,F), 
where Q = {L w \ w G E*}, 8{L w ,a) = L wa , qi = L £ = L, and F = {L w \ e G 
L w }. Note that every state L w of a quotient DFA is reachable from the initial 
state L by the word w. Also, the language of every state is distinct, since only 
distinct quotients are used as states. Thus every quotient DFA is minimal. The 
quotient IDFA of L is the quotient DFA of L after the empty state (if present) 
and all transitions incident to it are removed. The quotient IDFA is also minimal. 
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If a regular language L has quotient IDFA I, then the DFA A obtained by 
adding an empty quotient to I, if necessary, is the quotient DFA of L. Conversely, 
if L has quotient DFA A, then the IDFA X obtained from A by removing the 
empty quotient, if present, is the quotient IDFA of L. The two automata A and 
I are equivalent, in the sense that they accept the same language. 

The number k(L) of distinct quotients of L is the quotient complexity of L. 
Since the quotient DFA of L is minimal, quotient complexity is the same as state 
complexity, but there are advantages to using quotients [2]. 

The Myhill equivalence «l of L is defined as follows [15]: 

x ~l y if and only if uxv e Lft uyv £ L for all u, v £ S* . 

This equivalence is also known as the syntactic congruence of L. The quotient 
set S + 1 of equivalence classes of the relation , is a semigroup called 
the syntactic semigroup of L (which we denote by Tl), and S* / is the 
syntactic monoid of L. The syntactic complexity a(L) of L is the cardinality of 
its syntactic semigroup. The monoid complexity /j.(L) of L is the cardinality of 
its syntactic monoid. If the equivalence class containing e is a singleton in the 
syntactic monoid, then cr(L) = fi(L) — 1; otherwise, o-(L) — n{L). 

A partial transformation of a set Q is a partial mapping of Q into itself; we 
consider partial transformations of finite sets only, and we assume without loss 
of generality that Q = {1, 2, . . . , n}. Let t be a partial transformation of Q. If t 
is defined for i £ Q, then it is the image of i under t; otherwise it is undefined 
and we write it = □. For convenience, we let Dt = □. If X is a subset of Q, then 
Xt = {it | i £ X}. The composition of two partial transformations t\ and t^ of 
Q is a partial transformation ti o t 2 such that i(ti o i 2 ) = (iti)*2 for all i £ Q. 
We usually drop the composition operator "o" and write t\ti for short. 

An arbitrary partial transformation can be written in the form 

t _ ( 1 2-n-ln\ 
\i\i2 - ■■ in-i in J ' 

where ik = kt and i k £ Q U {□}, for 1 ^ k ^ n. The domain of t is the set 
dom t = {k £ Q | kt ^ □}. The image of t is the set img t = (dom t)t = {kt 
k £ Q and kt ^ □}. We also use the notation t — . . . , i n ] for the partial 
transformation t above. 

A (full) transformation of a set Q is a total mapping of Q into itself, whereas a 
permutation of Q is a mapping of Q onto itself. In other words, a transformation 
t of Q is a partial transformation where dom t = Q, and a permutation of Q is 
a transformation t where img t = Q. 

The identity transformation maps each element to itself, that is, it = i for 
i = 1, . . . , n. A transformation t contains a cycle of length k if there exist pairwise 
distinct elements i\,...,ik such that i\t = ii,iit = is, . . . ,ik-it = ik, and 
ikt — i\- A cycle is denoted by (i\, ii, ■ ■ ■ , ik)- For i < j, a transposition is 
the cycle (i, j), and (i,i) is the identity. A singular transformation, denoted by 
(*), has it = j and ht = h for all h ^ i, and (*) is the identity. A constant 
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transformation, denoted by (9), has it = j for all i. Let Tq be the set of all full 
transformations of Q. 

In this paper we study full transformations performed by DFA's and partial 
transformations performed by IDFA's. 

Let A = (Q,E,6,q\,F) be a DFA. For each word w G £*, the transition 

function defines a full transformation t w of Q: for all i G Q, it w = S(i,w). The 
set T4 of all such transformations by non-empty words forms a subsemigroup 
of 7q, called the transition semigroup of A [17]. Conversely, we can use a set 
{t a j a G £} of transformations to define S, and so the DFA A. When the context 
is clear we simply write a = t, where t is a transformation of Q, to mean that the 
transformation performed by a G S is t. If A is the quotient DFA of L, then T4 
is isomorphic to the syntactic semigroup Tj, of L [14], and we represent elements 
of Tl by transformations in T4. 

If I is an IDFA, then each word w G -S 1 * performs a partial transformation 
of Q. The set of all these partial transformations is the transition semigroup of I. 
If I is the quotient IDFA of a language L, this semigroup is isomorphic to the 
transition semigroup of the quotient DFA of L, and hence also to the syntactic 
semigroup of L. 

3 Aperiodic Transformations 

A transformation is aperiodic if it contains no cycles of length greater than 1. 
A semigroup T of transformations is aperiodic if and only if it contains only 
aperiodic transformations. Thus a language L with quotient DFA A is star-free 
if and only if every transformation in T4 is aperiodic. 

Each aperiodic transformation can be characterized by a forest of labeled 
rooted trees as follows. Consider, for example, the forest of Fig. 1 (a), where the 
roots are at the bottom. Convert this forest into a directed graph by adding a 
direction from each parent to its child and a self- loop to each root, as shown in 
Fig. 1 (b). This directed graph defines the transformation 

f I 2 3 4 5 6 7\ 
^1445577/ 




Fig. 1. Forests and transformations. 
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and each such transformation is aperiodic since the directed graph has no cycles 
of length greater than one. Thus there is a one-to-one relation between aperiodic 
transformations of a set of n elements and forests with n nodes. 

Proposition 1. For n ^ 1, the number of aperiodic transformations of a set of 
n elements is (n + l) n_1 . 

Proof. By Caley's theorem [6,19], there arc (n + l)" -1 labeled unrooted trees 
with n + 1 nodes. If we fix one node, say node n + 1, in each of these trees to be 
the root, then we have (n + labeled trees rooted at n + 1. Let T be any 

one of these trees, and let v\, . . . , v m be the parents of n + 1 in T. By removing 
the root n + 1 from each such rooted tree, we get a labeled forest F with n 
nodes formed by m rooted trees, where v\,...,v m are the roots. The forest F is 
unique since T is a unique tree rooted at n + 1. Then we get a unique aperiodic 
transformation of {1, ... ,n} by adding self- loops on v\, . . . , v m . 

All labeled directed forests with n nodes can be obtained uniquely from some 
rooted tree with n + 1 nodes by deleting the root. Hence there are (n + 1)™ _1 
labeled forests with n nodes, and that many aperiodic transformations of a set 
of n elements. □ 

Since all the inputs of an aperiodic DFA must perform aperiodic transforma- 
tions, we have the following result: 

Corollary 1. For n > 1, the syntactic complexity cr(L) of a star-free language 
L with n quotients satisfies cr(L) < (n + l) n_1 . 

The bound of Corollary 1 is our first upper bound on the syntactic complexity 
of a star free language with n quotients, but this bound is not tight in general 
because the set A n of all aperiodic transformations is not a semigroup under 
composition for n ^ 3. For example, if a — [1, 3, 1] and b — [2, 2, 1], then ab = 
[2, 1, 2], which contains the permutation (1, 2). Since the set of all transformations 
of a DFA by non-empty words is a semigroup, our task is to find the size of the 
largest semigroup contained in A n . 

First, let us consider small values of n: 

1. If n = 1, the only two languages, and S* , are both star- free, since S* = 0. 
Here a(L) — 1, the bound 2° = 1 of Corollary 1 holds and is tight. 

2. If n = 2, there are 4 transformations possible. Since [2, 1] is a permutation, 
the automata of star-free languages can have at most 3 transformations, and 
the bound 3 1 = 3 of Corollary 1 holds. We can assume that state 1 is initial, 
and state 2 is final, for if the initial and final states are interchanged, we still 
get the same transition semigroup, and hence the same syntactic semigroup. 
Suppose state 2 is reached from state 1 by input a, that is, la = 2; then we 
must have 2a = a, since a must be aperiodic. 

(a) If E = {a}, then the language L is e — aa*, and a(L) = 1. 

(b) If £ = {a, b}, there are three possibilities for b. First, if lb = 2b = 2, 
then this is the same as the case of a one-letter alphabet. Second, if 
lb = 2b = 1, then the language is S*a = 0a, and a(L) = 2. Third, if 
lb = 1 and 2b = 2, then L = E*aE*, and a(L) = 2. 
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(c) If S = {a, b,c}, the language L = (b + c + a(a + c)*b)*a{a + c)* = 
S*a{a + c)* = S*aS*bS* has cr(£) = 3. 



In summary, for n — 1 and 2, the bound of Corollary 1 is tight for \S\ = 1 
and \S\ = 3, respectively. As we shall see later, this is not the case for n ^ 3. 

4 Monotonicity in Transformations, Automata and 
Languages 

In this section we study the syntactic semigroups of languages accepted by mono- 
tonic and related automata. 

We denote by C% the binomial coefficient "n choose k" . 

4.1 Monotonic Transformations, DFA's and Languages 

We show in Section 5 that a tight upper bound for n = 3 is 10, and that this 
bound is met by a monotonic language (defined below). This provides one reason 
to study monotonic automata and languages. A second reason is the fact that 
all the tight upper bounds on the quotient complexity of operations on star-free 
languages are met by monotonic languages [4]. 

A full transformation t of Q is monotonic if there exists a total order ^ on Q 
such that, for all p, q 6 Q, p < q implies pt < qt. From now on we assume that 
the total order is the usual order on integers, and that p < q means that p ^ q 
and p ^ q, as usual. 

Let Mq be the set of all monotonic full transformations of Q. The following 
result [8,11] summarizes some properties of the set of all monotonic transfor- 
mations. It is restated slightly for our purposes, since the work in [8] does not 
consider the identity transformation to be monotonic. 

Theorem 1 (Gomes and Howie). When n ^ 1, the set Mq of all mono- 
tonic full transformations of a set Q of n elements is an aperiodic semigroup of 
cardinality 

n 

|M Q |=/(n) = ^C fe "r 1 1 C fe " = Cf- 1 , 

k=l 

and it is generated by the set H = {a, b±, . . . , b n -i, c} of n + 1 transformations 
ofQ, where 

1. la = 1, ia = i — 1 for 2 ^ i n; 

2. For 1 < i ^ n — 1, ibi = i + and jbi = j for all j ^ i; 

3. c is the identity transformation. 

Moreover, the cardinality of the generating set cannot be reduced. 
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Table 1. The monotonic transformations for n = 3. 
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Example 1. For n = 1 there is only one transformation a = c = [1] and it 
is monotonic. For n = 2, the three generators are a = [1,1], &i = [2,2] and 
c = [1,2], and Mq consists of these three transformations. For n — 3, the four 
generators a = [1,1,2], b\ = [2,2,3], b 2 = [1,3,3], and c = [1,2,3] generate all 
ten monotonic transformations, as shown in Table 1. 

Now we turn to DFA's whose inputs perform monotonic transformations. 
A DFA is monotonic [1] if all transformations in its transition semigroup are 
monotonic with respect to some fixed total order. Every monotonic DFA is ape- 
riodic because a monotonic transformation has no non-trivial permutations, and 
the composition of monotonic transformations is monotonic. A regular language 
is monotonic if its quotient DFA is monotonic. 

Let us now define a DFA having as inputs the generators of Mq: 

Definition 1. For n > 1, let A„ = (Q,S,5,1,{1}) be the DFA in which Q = 
{1, . . . , n}, U = {a, fo„_i, c}, and each letter in U performs the transfor- 

mation defined in Theorem 1. 

DFA A n is minimal, since state 1 is the only accepting state, and for 2 < i ^ n 
only state i accepts a 4 " 1 . From Theorem 1 we have 

Corollary 2. For n 1, the syntactic complexity o~(L) of any monotonic lan- 
guage L with n quotients satisfies o{L) ^ C^"" 1 . Moreover, this bound is met 
by the language L{A n ) of Definition 1, and cannot be met by any monotonic 
language over an alphabet having fewer than n + 1 letters. 

4.2 Monotonic Partial Transformations and IDFA's 

As we shall see, for n > 4 the maximal syntactic complexity cannot be reached 
by monotonic languages; hence we continue our search for larger semigroups of 
aperiodic transformations. 

A partial transformation t of Q is monotonic if there exists a total order ^ 
on Q such that, for all p, q G dom t, p ^ q implies pt < qt. As before, we assume 
that the total order on Q is the usual order on integers. Let PMq be the set 
of all monotonic partial transformations on Q with respect to such an order. 
Gomes and Howie [8] showed the following result (again restated slightly): 
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Theorem 2 (Gomes and Howie). When n ^ 1, the set PMq of all mono- 
tonic partial transformations of a set Q of n elements is an aperiodic semigroup 
of cardinality 

n 

\PM Q \=g{n) = Y J C%CZ +k -\ 

fc=0 

and it is generated by the set I = {a, b\, ... , 6 n _i, ci, . . . , c n -i, d} of 2n partial 
transformations of Q, where 

1- ja = j - 1 for j = 2,...,n; 

2. For 1 ^ i ^ n — I, ibi = i + 1 and jbi = j for j = 1, . . . , i — 1, i + 2, . . . , n; 

3. For 1 ^ i < n — 1, ici = i + 1, and jci = j for all j ^ i; 
4- d is the identity transformation. 

Moreover, the cardinality of the generating set cannot be reduced. 

Example 2. For n — 1, the two monotonic partial transformations arc a = [□], 
d = [1]. For n = 2, the eight monotonic partial transformations are generated by 
a = [□, 1], bi = [2, □], Ci = [2, 2], and d = [1, 2]. For n = 3, the 38 monotonic par- 
tial transformations are generated by a — [0,1,2], b\ = [2, D,3], 62 = [1,3, □], 
ci = [2,2,3], c 2 = [1,3,3] and d = [1,2,3]. 

Partial transformations correspond to IDFA's. For example, the transforma- 
tions a — [D,l], b = [2,D] and c = [2,2] correspond to the transitions of the 
IDFA of Fig. 2 (a). 

An IDFA is monotonic if all partial transformations in its transition semi- 
group are monotonic with respect to some fixed total order. A regular language 
is partially monotonic if its quotient IDFA is monotonic. 

Definition 2. For n ^ 1, let B n = (Q, E,S, 1, {1}) be the IDFA in which Q = 
{1, . . . , n}, S = {a, b\, . . . , 6 n _i, Ci, . . . , c„_i, d}, and each letter in £ performs 
the partial transformation defined in Theorem 2. 

Corollary 3. For n ^ 1, the syntactic complexity o~(L) of any partially mono- 
tonic regular language L with n non-empty quotients has an upper bound o~(L) ^ 
J2k=o Cfr +h ~ 1 . Moreover, this bound is met by the language L{B n ) of Defini- 
tion 2, and cannot be met by any partially monotonic language over an alphabet 
having fewer than 2n letters. 



b 




(a) a ' 6 ' c (b) 



Fig. 2. Partially monotonic DFA's: (a) incomplete; (b) complete. 
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We can also reformulate this result in terms of quotient DFA's. In the follow- 
ing, we let Q' = {l,...,n - 1}, and Q = Q' U {n}. A quotient DFA A = 
(Q,S,S,q\,F) is partially monotonia if the corresponding quotient IDFA is 
monotonic. So a regular language is partially monotonic if its quotient DFA 
is partially monotonic. 

Example 3. If we complete the transformations in Fig. 2 (a) by replacing the 
undefined entry □ by a new empty (or "sink") state 3, as usual, we obtain the 
DFA of Fig. 2 (b). That DFA is not monotonic, because 1 < 2 implies 2 < 3 
under input 6, and 3 < 1 under a, which in turn implies that 3 < 2 under b. 
A contradiction is also obtained if we assume that 2 < 1. However, this DFA is 
partially monotonic, since removal of the empty state 3 results in the IDFA of 
Fig. 2 (a), whose partial transformations are monotonic. 

The DFA of Fig. 3 has an empty state, and it is monotonic for the order 
shown. The IDFA consisting of states 2, 3, and 4 is also monotonic. 



Fig. 3. Partially monotonic DFA that is monotonic and has an empty state. 

The transition semigroup of the DFA of Fig. 3 consists of the transformations 
a = [1,3,3,3], b = [1, 1,4,4], ab = [1,4,4,4], and ba = [1, 1,3,3]. The transition 
semigroup of the corresponding IDFA with state set {2, 3, 4} consists of a = 
[3,3,3], b = [0,4,4], ab — [4,4,4], and ba = [0,3,3], and the two semigroups 
are isomorphic. 

Now consider the following semigroup CMq of monotonic completed trans- 
formations on a set Q = {1, ...,n}. Start with the semigroup PMqi of all 
monotonic partial transformations on Q' = {1, . . . , n — 1}. Convert the partial 
transformations to full by adding n, replacing all the O's by n's, and letting the 
image of n be n. As we have noted before, there is one-to-one correspondence 
between the monotonic partial transformations of Q' and the corresponding full 
transformations of Q. Thus semigroups CMq and PMqi are isomorphic. 

Example 4- For n = 2, the two monotonic completed transformations are a = 
[2,2], d = [1,2]. For n = 3, the eight monotonic completed transformations are 
generated by a = [3, 1,3], &i = [2,3,3], a = [2,2,3], and d = [1,2,3]. 

Definition 3. For n > 1, let B' n = (Q, S, S, 1, {1}) be the DFA in which Q = 
{1, . . . , n}, S — {a, 6„_2, Ci, . . . , c„_2, d}, and each letter in E defines a 

transformation as follows: 



a, b 



a 



b 
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1. la = na = n, and ja = j — 1 for j = 2, . . . , n — 1; 

2. For 1 ^ i < n — 2, ibi = i + 1, (i + l)bi = n, and jbi = j for j = 1, . . . , i — 
l,i + 2, ...,n; 

3. For I ^ i ^ n — 2, ic, = i + 1, and jci — j for all j ^ i; 
4- d is the identity transformation. 

Theorem 2 then implies the following result: 

Corollary 4. For n ^ 1, the syntactic complexity <r(L) of any partially mono- 
tonic language L with n quotients satisfies u{L) «C g(n—l) = X)fc=o Ck~ 1 C^ +k ~ 2 . 
Moreover, this bound is met by the language L(B' n ) of Definition 3, and cannot 
be met by any partially monotonic language over an alphabet having fewer than 
2n — 2 letters. 

4.3 Nearly Monotonic Transformations and DFA's 

Partially monotonic languages still don't reach the maximal bound for syntactic 
complexity, so we continue the search for larger semigroups of aperiodic trans- 
formations. 

Let NMq be the union of CMq and the set of all constant transformations 
of Q; we shall call these transformations nearly monotonic with respect to the 
usual order on integers. 

Theorem 3. When n ^ 2, the set NMq of all nearly monotonic transforma- 
tions of a set Q of n elements is an aperiodic semigroup of cardinality 

n-l 

\NM Q \ = h(n) = ]T C^C^- 2 + n - 1, 

fc=0 

and it is generated by the set J = {a, b±, . . . , &n-2, c±, . . . , c„_2, d, e} of 2n — 2 
transformations ofQ, where 

1. la = na = n, and ja = j : — 1 for j = 2, . . . , n — 1; 

2. For I ^ i ^ n — 2, ibi = i + I, (i + = n, and jbi — j for j = 1, . . . , i — 
l,i + 2,...,n; 

3. For 1 < i ^ n — 2, ici = i + and jci = j for all j ^ i; 
4- d is the identity transformation; 

5. e is the constant transformation (^) . 

Moreover, the cardinality of the generating set cannot be reduced. 

Proof. Let /' = {a, b\, . . . , b n -2, c\, . . . , c„_2, d}. We know from [8] that the 
semigroup CMq — (I') generated by /' is isomorphic to PMqi, where Q 1 = 
{1, . . . , n - 1}. Let J = I' U {e}, and if = (J) \ (/')• 

The composition of a constant transformation with any transformation is 
a constant. Thus any transformation in NMq is either constant or is in the 
semigroup CMq. Since constant transformations are aperiodic, and CMq is 
aperiodic, NMq is also aperiodic. 
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Note that K contains only constant transformations. Since (/') ~ PMqi, for 
all j G Q, Sj = («')(") G (I') C (J). Thus es, = ( Q ) G (J) for all 1 < j < n. 
Among these constant transformations, only (^) G (/'); so K contains (9) for 
1 < j ^ n - 1 and \K\ = n - 1. Thus ft(n) = |(J)| = |PM Q /| + n - 1. 

Since the cardinality oil' cannot be reduced, and e ^ (/'), also the cardinality 
of J cannot be reduced. □ 

Example 5. For n = 2, the three nearly monotonic transformations are a = [2, 2], 
d = [1, 2] and e = [1, 1]. For n = 3, the ten nearly monotonic transformations are 
generated by a = [3,l,3],&i = [2,3,3], a = [2,2,3],d = [1, 2, 3], and e = [1,1,1]. 

Let .4 = (Q, S, S, qi, F) be a DFA. An input a G £ is constant if <5(p, a) = 
a) for all p,q € Q, that is, a performs a constant transformation of Q. Then 
.4 is nearly monotonic if, after removing constant inputs, the resulting DFA A' 
is partially monotonic. A regular language is nearly monotonic if its quotient 
DFA is nearly monotonic. 

Definition 4. For n ^ 2, let C n = (Q, E, 5, 1, {1}) be a DFA, where Q = 
{1, ...,n}, 17 = {a, &i, . . . , b n -2, Ci, . . . , c„_2, d, e}, anrf eac/i tetter m S per- 
forms the transformation defined in Theorem 3. 

Theorem 3 now leads us to the following result: 

Theorem 4. For n > 2, the syntactic complexity a{L) of any nearly monotonic 
language L with n quotients satisfies cr(L) ^ h(n) = X)fc=o C'k~ 1 CJ!: +k ~ 2 +n — l. 
Moreover, this bound is met by the language L(C n ) of Definition 4, and cannot 
be met by any nearly monotonic language over an alphabet having fewer than 
2n — 1 letters. 

Proof. Note that 1 is the initial state of C n . For 2 ^ i ^ n — 1, state i is 
reached by Wi = 6i • • • State n is reached by w n -ib n ~2- Thus all states 
are reachable. For 1 < i < n — 1, state i accepts the word a 4-1 , while all 
other states reject it. Also, state n rejects a % for all i ^ 0. So all n states 
are distinguishable, and C„ is minimal. Therefore L has quotient complexity 
k(L) — n. The syntactic semigroup of L is generated by J, and so L has syntactic 
complexity a(L) = h(n) = YJ^l C%~ 1 C% +k - 2 + n — 1. □ 

Although we cannot prove that NMq is the largest semigroup of aperiodic 
transformations, we can show that no transformation can be added to NMq 
without destroying aperiodicity. 

A set S = {Ti,T 2 , ■ ■ - ,Tk} of transformation semigroups is a chain if T\ C 
T<i C • • • C Tfc. Semigroup is the largest in S, and we denote it by max(S') = 
Tfc. The following result shows that the syntactic semigroup T L ^ Cn) = Tc n of 
L(C n ) in Definition 4 is a local maximum among aperiodic subsemigroups of 7q. 

Proposition 2. Let S be a chain of aperiodic subsemigroups ofTq. IfTc n G S, 
then Tc n — max(5). 
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Proof. Suppose max(5) = T k for some aperiodic subsemigroup T k of Tq, and 
Tk 7^ Tc n . Then there exists t G T k such that t Tc n ; thus there exists i,j G Q 
such that i < j ^ n but it > jt, and it,jt ^ n. Let r G 7q be the transformation 
of Q such that (jt)r = i, (it)r = j, and hr = n for all h ^ then r G Tc n - 
Let A G 7q be such that iX = i, jX = j, and hX = n for all h ^ then also 
A G T Cn . Since T k = max(S), T Cn C T k and r, A G T k . Then s = Xtr is also in 
Tfc. However, is = i(Xtr) = j, js = j(Xtr) = i, and hs — n for all h ^ then 
s = {i,j)(n)-> where P = Q\ {i, j}, is not aperiodic, a contradiction. Therefore 
T c „ = max(5). □ 

4.4 Containment and Closure Properties 

Let Lm, Lpmj IL^vm , and L,sf be the classes of monotonic, partially monotonic, 
nearly monotonic and star-free languages. Then the following holds: 

Proposition 3. 

Lm £ Lpm Si L^m Si Lgir. 

Proof. The DFA of Fig. 2 (b) is partially monotonic but not monotonic. If we add 
to that DFA the input d = [1,1,1], then the new DFA is nearly monotonic but 
not partially monotonic. The DFA of Figure 4 is aperiodic, as one can verify. 
It has no constant input; hence it must be partially monotonic if it is nearly 
monotonic. Since it has no empty state, it must be monotonic if it is partially 
monotonic. However, if 1 < 2. then 3 < 2 by a, and also 2 < 3 by b. We get a 
similar contradiction if we set 2 < 1. Therefore A is not monotonic. □ 

Proposition 4. The classes Lm, ^pm, andl,NM are closed under complemen- 
tation and left quotients, but not under union, symmetric difference, intersection, 
difference, concatenation and star. 

Proof. Closure under complementation is obvious, since the properties of mono- 
tonicity do not involve final states. Also, the left quotient of any monotonic 
language is defined by a DFA that is a subautomaton of the original language, 
and closure under quotients follows. 

The language of the DFA of Figure 4 is not nearly monotonic. However, it 
is the union of languages L x = a(e U E*b) and L 2 = bS*b. The DFA of L x is 
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shown in Figure 3, and it is monotonic. One verifies that Li is also monotonic. 
This proves that each of the classes is not closed under union. Since L\ and 
Li arc disjoint, each class is not closed under symmetric difference. Since each 
class is closed under complement, it can be closed under neither intersection nor 
difference. 

Consider now DFA's: A 3 = ({1, 2, 3}, {a, b}, 5 3 , 1, {2}), where a = [1,1,1] 
and b = [2,3,3], and At = ({1, 2, 3}, {a, b}, 64, 1, {2}), where a = [1,1,3] and 
b = [2,3,3]. Both A3 and A4 are monotonic. Let L = L(A3)L{Ai). Then L 
is star-free with quotient DFA A = ({1, 2, 3, 4, 5, 6}, {a, &}, S, 1, {3, 6}) shown in 
Figure 5, where a = [1,5,5,1,5,5] and b = [2,3,4,4,6,3]. 



a 




Fig. 5. Quotient DFA A of concatenation of partially monotonic DFA's A3 and Aa- 

However, A is not nearly monotonic. If 1 < 6, then l<5bya,2<3by input 
6, 3 < 4 by b again, and 5 < 1 by input a, which is a contradiction. We get a 
similar contradiction by assuming 5 < 1 . Hence the class of partially monotonic 
DFA's is not closed under product. 

Lack of closure under star follows since each class contains the language {aa}, 
and the star of {aa} is not star-free. □ 

5 Syntactic Complexity of Star-Free Languages for n = 3 

We now prove that the bound reached by monotonic automata is an upper bound 
for star- free languages with n = 3. 

For n = 3, there are 3 3 = 27 possible transformations. The transformations 
[2, 3, 1], [3, 1, 2] are cycles of length 3; whereas 

[1, 3, 2], [2, 1, 1], [2, 1, 2], [2, 1, 3], [2, 3, 2], [3, 1, 1], [3, 2, 1], [3, 3, 1], [3, 3, 2] 

have cycles of length 2. This leaves 16 aperiodic transformation as potentially 
possible for star-free languages. 

We say that two aperiodic transformations a and b conflict if ab or ba has a 
cycle. Of the 16 transformations, transformations [1, 2, 3], [1,1,1], [2, 2, 2], [3, 3, 3] 
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cannot create any conflicts. Hence we consider only the remaining 12; their 
multiplication table is shown in Table 2, where the bold face entries indicate 
conflicting pairs. Since each of the 12 transformations appears as a product of 
some other transformations, none is essential. 

Table 2. Conflicting pairs for n = 3. 
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Each of the following six transformations has only one conflict: 

[1,1,3], [1,2,1], [1,2,2], [1,3,3], [2,2,3], [3, 2,3]. 

There are also two conflicting triples shown in dotted lines in the conflict graph of 
Fig. 6: ([1,1,3], [1,2,2], [3,2,3]) and ([1,2,1], [1,3,3], [2,2,3]); normal lines show 



( 113 } ---- { 122 } ---- { 323 ] { 121 } ---- {" 133 } ---- {223~] 
( 322 ) [ 313 ) ( 112 ) ( 233 ) ( 221 ) ( 131 ) 



Fig. 6. Conflict graph for n = 3. 

conflicting pairs. We can choose at most two inputs from each triple and at 
most one from each conflicting pair. Hence there are at most 6 conflict-free 
transformations from Table 2, for example, 

[1,1,2], [2,2,3], [1,3,3], [1,1,3], [1,2,2], [2,3,3]. 
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Adding the identity and the three constant transformations, we get a total of 
at most 10. The inputs shown in Table 1 are conflict-free and generate precisely 
these 10 transformations. Hence a(L) ^ 10 for any L with n = 3. 

6 Conclusions 

We conjecture that the syntactic complexity of languages accepted by the nearly 
monotonic automata of Definition 4 meets the upper bound for star-free lan- 
guages in general: 

Conjecture 1 The syntactic complexity of a star-free language L with k(L) = 
n 2 satisfies o~(L) $C h(n), and this bound is tight. 

Our results are summarized in Table 3. Let Q = {1, . . . , n}, and Q' — Q\{n}. 
The figures in bold type are tight bounds verified using GAP [7], by enumerating 
aperiodic subsemigroups of 7q . The asterisk * indicates that the bound is already 
tight for a smaller alphabet. The last four rows show the values of f(n) — \Mq\, 
g(n — 1) = \PMq>\, the conjectured tight upper bound h(n) = \PMq>\ + n — 1, 
and the weaker upper bound (n + l)™ -1 (not tight in general). 



Table 3. Syntactic complexity of star-free languages. 
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